Shared Speakerphone System for Multiple Devices in a Conference Room

ABSTRACT

A speakerphone system is shared with multiple participant devices of participants in a physical meeting that are using a web conferencing service. An active speaker is identified from the participants. The participant device of the active speaker is switched, such that the speakerphone system receives and renders audio of the active speaker. Video of the participant device of the active speaker is enabled, such that the web conferencing service displays the video to the participant devices.

BACKGROUND Field

This disclosure relates generally to videoconferencing, and morespecifically, to shared speakerphone system in a videoconferencingenvironment.

Related Art

Through information handling systems or devices, web based conferencingmeetings through remote or web based are held with participants atvarious locations. Remote or web based conferencing has been proven tobe very useful when participants cannot physically meet in one location.Each participant joins the meeting through their information handlingsystem or device, such as a laptop computer. The devices includemicrophones that pick up/receive audio (e.g., participant talking),speakers that provide audio, cameras that capture video, anddisplays/screens that present images and video. As participants havebeen accustomed to remotely attend web based conferencing meetings, theyhave found out how useful participating through their devices can be.Participants are able to share with one another data and information ontheir devices, such as images, video, audio, files, etc.

Remote or web based conferencing works well when participants are notphysically with one another. When participants get together tophysically meet in one room, issues arise when participants desire touse their devices to join in the meeting. It can be desirable forparticipants to use their devices as discussed above; however, withdifferent participant devices having their own microphones pickingambient audio and speakers providing audio, there can be cross talk andinterference. In addition, there can be issues identifying whichparticipant is actually speaking at the time.

As the value and use of information continues to increase, individualsand businesses seek additional ways to process and store information. Aninformation handling system generally processes, compiles, stores, orcommunicates information or data for business, personal, or otherpurposes, thereby allowing users to take advantage of the value of theinformation. Because technology and information handling needs andrequirements vary between different users or applications, informationhandling systems may also vary regarding what information is handled,how the information is handled, how much information is processed,stored, or communicated, how quickly and efficiently the information maybe processed, stored, or communicated, and security of the informationprocessing, storage, or communication.

BRIEF DESCRIPTION OF THE DRAWINGS

Embodiments of the present invention may be better understood byreferencing the accompanying drawings.

FIG. 1 is a general illustration of components of an informationhandling system as implemented in the present invention;

FIG. 2 is general illustration of a system for multiple devices to sharea speakerphone system in accordance with embodiments of the presentinvention;

FIG. 3 is a general illustration of a system for a conference room thatsupports multiple devices that share a speakerphone system in accordancewith embodiments of the present invention; and

FIG. 4 is a generalized flowchart for sharing a speakerphone system withmultiple devices in a conference room setting in accordance withembodiments of the present invention.

The use of the same reference symbols in different drawings indicatesidentical items unless otherwise noted. The figures are not necessarilydrawn to scale.

DETAILED DESCRIPTION

Embodiments of the present invention are intended to address the problemof using multiple devices that are using remote web conferencing servicein a physical location, such as a conference room. A speakerphone systemis shared with multiple devices. The devices are using the webconferencing service to conduct a meeting. The devices are particular toparticipants participating in the meeting who are using their devices inthe physical location (conference room). A participant who is activelyspeaking during the meeting, known as an active speaker, is identified.The device of the active speaker is chosen or switched to, allowing thespeakerphone system to receive and render audio of the device of theactive speaker. Video of the device of the active speaker is alsoenabled to allow the video to be displayed on all the devicesparticipating in the meeting.

FIG. 1 is a generalized illustration of an information handling system100 that can be used to implement the system and method of the presentinvention. The information handing system 100 can be a host to theperipheral devices described herein. Implementations provide for theinformation handling system 100 can include devices such as a notebookor laptop personal computer (PC) or a PC integrated into a keyboard, asmartphone, or other computing device as described herein.

The information handling system 100 includes a processor(s) (e.g.,central processor unit or “CPU”) 102. The information handling system100 includes input/output (I/O) devices 104, such as a microphone 106and a camera 108. Other I/O devices 104 include a speaker(s), akeyboard, a video/display, a mouse, and associated controllers (e.g.,K/V/M). The information handling system 100 further includes a harddrive or disk storage 110, and various other subsystems 112.

In various embodiments, the information handling system 100 alsoincludes network port 114 operable to connect to a network 140, wherenetwork 140 can include one or more wired and wireless networks, such asWiFi, and including the Internet. Network 140 is likewise accessible bya service provider server 142. The information handling system 100likewise includes system memory 116, which is interconnected to theforegoing via one or more buses 118. System memory 116 can beimplemented as hardware, firmware, software, or a combination of such.System memory 116 further includes an operating system (OS) 120.Embodiments provide for the system memory 116 to include applications122.

Implementations, provide for the applications 122 to include a webconference application 124 configured to connect to a remote or webvideo conferencing service as further described herein. Examples ofvideo conferencing services can include Zoom, Microsoft Teams, GoogleHangouts, Skype, Dialpad Meetings, TrueConf Online, Skype,FreeConference, Lifesize Go, Slack Video Calls, Facebook Live, YouTubeLive, etc.

Applications 122 can include a talker identification (ID) 126 that canbe part of a tracker ID system further described herein. Talker ID 126uses microphone 106 of input/output (I/O) devices 104 to receive voiceinput of a participant. Implementations provide for enrollment andrecognition of participant voices. Enrollment can include receiving afew seconds, such as three seconds, of a participant voice andassociating the particular participant voice to a unique signature. Invarious implementations, the unique signature is associated with aunique identifier, such as a “192” place value numerical identifier.

Talker ID 126 and talker ID system can be based on an active machinelearning model that processes audio to create the unique signatures thatare associated to participants. When a participant speaks into themicrophone 106, talker ID 126 is configured to recognize the enrolledidentified voice which is associated with a participant. Implementationsprovide for talker ID 126 to recognize the participant and the device(i.e., information handling system 100) that is being used.Implementations provide for information handling system 100 and talkerID 126 to enroll more than one participant.

Applications 122 can include active speaker recognition 128. When aparticipant speaks, they are considered as an active speaker. The activespeaker “controls” the audio of the meeting, where the active speaker isable to talk over other participants. Active speaker recognition 128detects (active speaker detection) and acknowledges the active speaker.Implementations include active speaker recognition 128 to provide forthe “reading” of lips of participants. Camera 108 of I/O devices 104captures video of participants. The video focusing on the lips of theparticipants. Using camera 108 and active speaker recognition 128, aparticipant lips are detected as moving. A participant whose lips aremoving is recognized as actively talking and identified as an activespeaker. For implementations where active speaker recognition is solelyused, no training or enrollment is involved, unlike when talker ID 126is used.

In various implementations, active speaker recognition 128 works withtalker ID 126 to identify the talking participant (active speaker),where the active speaker is recognized by active speaker recognition 128and talker ID 126. Talker ID 126 creates a unique signature for activespeakers who have not been previously enrolled.

Applications 122 includes a selector switch hub 130. The selector switchhub 130 is configured to go between devices of participants. In variousimplementations, the selector switch hub 130 selects and automaticallyswitches to a “best device”. The “best device” is the device whosemicrophone 106 is allowed to pick audio and through the web conferencingservice The audio is sent by the web conference service to aspeakerphone system supporting the meeting between participants. The“best device” is the device of the active speaker.

Implementations provide for information handling system 100, andparticularly hard drive/disk 110 to include a common dictionary 132. Asdiscussed, the talker ID 126 and talker ID system as further describedherein, is used to enroll unique signatures of voices of participants.Implementations provide for the enrolled signatures to be stored incommon dictionary 132. The common dictionary 132, along with activemachine learning model further described herein, is used to infer activespeaker during meetings. In certain implementations, a talker IDauto-pairing is performed. A prestored talker ID signature and a device(e.g., laptop) information pair is created, associating participantswith devices (e.g., laptops). Such talker ID-device pairs can be storedin common dictionary 132. In various implementations, talker ID-devicepairs can be used when joining meetings. In various implementations,when a device (e.g., laptop) identifies that the participant is talking,selector switch hub 130 detects the that the same participant istalking, and it is determined that the device (e.g., laptop) is in thesame room (e.g., conference room) as the participant.

In certain implementations, relevant entries are extracted from commondictionary 128 for a specific meetings, considering that numerousentries can be accumulated in common dictionary, and only relevantentries (e.g., participants) are used for a specific meeting. This canreduce computational cost and increase accuracy. Implementations providefor subsequent meetings to use the same data/information in the commondictionary 128.

FIG. 2 is general illustration of a system for multiple devices to sharea speakerphone system 200 to implement the methods of the presentinvention. The system 200 includes the network 140. As discussed,network 140 can include wired and wireless networks, including theInternet. Network 140 provides interconnectivity between the elements ofsystem 200.

The system 200 includes participant devices 202. Implementations providefor the participant devices 202 to be configured as an informationhandling system 100 described in FIG. 1 . The participant devices 202are used by and associated with participants in a meeting in a physicallocation, such as conference room 204. The participant devices use aspeakerphone system 206 located in conference room 204. Implementationsprovide for the speakerphone system 206 to pick up audio and render theaudio to the web conferencing service 208, as further described herein.

The participant devices 202 and speakerphone system 206 connect to anduse a web conferencing service 208. As discussed, examples of webconferencing services can include Zoom, Microsoft Teams, GoogleHangouts, Skype, Dialpad Meetings, TrueConf Online, Skype,FreeConference, Lifesize Go, Slack Video Calls, Facebook Live, YouTubeLive, etc. Implementations provide that one web conferencing service isused for participant meetings. As discussed, participant devices accessa web conferencing service 208, when a meeting takes place.

In various implementations, the system 200 includes a talker ID system210. Embodiments provide for the talker ID system 210 to be configuredas an information handling system described in FIG. 1 . It is also to beunderstood that the talker ID system 210 can be implemented in variouscomputing platforms, including cloud computing. Implementations includethe talker ID system 210 to be configured as an edge device as part of alocal network (e.g., within conference room 205). Implementationsprovide for edge devices to be configured as conference devices, such asvideo bars. Implementations also include talker ID system 210 to beconfigured as dedicated server or servers.

Implementations provide for the talker ID system 210 to be communicateand receive data/information from the Talker ID application 122 ofparticipant devices 206. As discussed, enrollment of participants can beimplemented, where a participant’s voice is received and a uniquesignature is associated with the participant’s voice. An active machinelearning model 212 processes the audio to create the unique signatures.

In various implementations, the system 200 includes a common dictionary214. As discussed, a common dictionary 128 can be included in aninformation handling system 100; however, common dictionary 214 can alsobe provided in remote store. As discussed, common dictionary 214 can beused to store the enrolled unique participant signatures. The commondictionary 214, along with active machine learning model 212 can be usedto infer an active speaker during meetings. As discussed, talker IDauto-pairing is performed. Talker ID-device pairs can be stored incommon dictionary 214.

As discussed, in certain implementations, relevant entries can beextracted from common dictionary 132 for specific meetings, which alsoapplies to the common dictionary 214. Implementations can also providefor subsequent meetings to use the same data/information in the commondictionary 214.

FIG. 3 is a general illustration of a system 300 for conference room 204that supports multiple participant devices 202 that share speakerphonesystem 206. Multiple participants 302-1 to 302-N physically gather for ameeting in conference room 204 or other physical location. Eachparticipant 302 has their own participant device 202, which can includelaptop computers/laptops. Implementations provide for the participantdevices 202 to have a camera view and web conference account or webconference application 124 that connects to web conferencing service 208via respective connectivity 304. Connectivity 304 can be through WiFi,WLAN, other wired/wireless communications, including a combination ofsuch, including network 140.

Although, the participant devices 202 are configured with microphones106 and speakers (i.e., I/O devices 104), such microphones 106 andspeakers can be different from one another, and have varyingsensitivity, frequency response, output, etc. Therefore, in variousimplementations, the speakerphone system 208 is used to pick up audiofrom participants 302, instead of the microphones 106 of the participantdevices 202. The audio is sent from the speakerphone system 206 to theweb conferencing service 208, where speakerphone system 206 is connectedto web conferencing service 208 by connectivity 306. Connectivity 306can be through WiFi, WLAN, other wired/wireless communications,including a combination of such, including network 140. Implementationsprovide for the web conferencing service 208 to provide audio (i.e.,meeting audio) to the speakerphone system 206.

Implementations provide for the participant devices 202 to transmitvideo to the web conferencing service 208. In particular, video feed ofa participant device 202 of an identified active talker is provided bythe web conferencing service 208. In other words, an active talker isidentified, and only the video feed of the participant device 202 to webconferencing service 208 of that active talker is provided by the webconferencing service 208. Web conferencing service 208 sends the videoto all participant devices 202 to be displayed as part of the meeting..Each participant 302 receives the video on their respective participantdevices 202

The participant devices 202 are connected to the speakerphone system 206by respective connectivity 308. In various embodiments, speakerphonesystem 206 is configured as a computing device, such as an informationhandling system 100 described in FIG. 1 . Various configurations includethe speakerphone system 206 to be implemented as a video bar, videoconferencing system, conference phone, etc.

Implementations provide for the speakerphone system 208 to include theapplications talker ID 122 to identify an active speaker, and selectorswitch hub 126 to select a participant device. When speakerphone system208 is configured with selector switch hub 126, speakerphone system 208performs the switching between active speakers (i.e., participants 300).In other implementations, one of the participant devices 206 isconfigured with the selector switch hub 126 and performs the switchingbetween active speakers (i.e., participants 300). Implementationsprovide for the speakerphone system 208 to connect with this participantdevice 206 and the participant devices 206 by USB, or otherwired/wireless connections.

As discussed, the selector switch hub 126 is configured to automaticallyswitch between participant devices 202. In particular, the selectorswitch hub 126 switches to a “best device”. When the active talker isrecognized by either talker ID 122 or talker ID system 210, theparticipant device 202 of that active talker is identified as the “bestdevice.” In other words, the “best device” is identified based on whichparticipant that is actively speaking. In various implementations, the“best device” is the device whose microphone 106 is allowed to pickaudio and through the web conferencing service, the audio is rendered toa speakerphone system supporting the meeting between participants. The“best device” is the device of the active speaker.

Alternatively, as described, instead of using talker identificationwhich requires training, which a participant may not desire, videodetection of synchronization of lip movement can decide whichparticipant device 202 has an active talker in front of the participantdevice 202. For this approach, no training/enrollment is required.

FIG. 4 is a generalized flowchart for sharing a speakerphone system withmultiple devices in a conference room setting. The order in which themethod is described is not intended to be construed as a limitation, andany number of the described method steps may be combined in any order toimplement the method, or alternate method. Additionally, individualsteps may be deleted from the method without departing from the spiritand scope of the subject matter described herein. Furthermore, themethod may be implemented in any suitable hardware, software, firmware,or a combination thereof, without departing from the scope of theinvention.

At step 402, the process 400 starts. At step 404, identifying isperformed of an active speaker of a group of participants in a physicalmeeting. The physical meeting is in a conference room or similarlocation where the group of participants are gathered. The group ofparticipants participate in a videoconferencing meeting using theirparticular participant devices 202, and access a web conferencingservice 208 as described herein. The identifying is performed usingtalker ID 126, talker ID system, or active speaker recognition 128.

At step 406, choosing the participant device 202 of the identifiedactive speaker is performed. In particular, the chosen participantdevice 202 is the participant device 202 identified by the webconferencing service 208 to provide audio and video feed to the webconferencing service 208. The web conferencing service 208 sends audioto be played on the speakerphone system 206, and video to be displayedon the participant devices 202. The switching can be performed by theselector switch hub 130 described herein.

At step 408, capturing/transmitting active speaker audio andreceiving/rendering far-end audio is performed. The active speaker’saudio is captured by a microphone of the speakerphone system 208 androuted by selector switch hub 126 to the active speaker’s device 206.The active speaker’s device 206 sends the audio to far-end by the webconferencing service 202. The far-end audio is received by activetalker’s device 206 from the web conferencing service 202, and routed byselector switch hub 126 to the speakerphone system 208, and renderthrough a speaker of speakerphone system 208.

At step 410, enabling video of the participant device 206 is performedto allow the video to be shown by the web conferencing service 202. Eachparticipant device 206 has their own video; however, the video of theparticipant device 206 of the active speaker is displayed/shown by theweb conferencing service 202. At step 412, the process 500 ends.

Although the invention is described herein with reference to specificembodiments, various modifications and changes can be made withoutdeparting from the scope of the present invention as set forth in theclaims below. For example, the number of machine-learning models andassociated granularities used and the nature of the applicationgenerating the well-defined normal behavior data. Accordingly, thespecification and figures are to be regarded in an illustrative ratherthan a restrictive sense, and all such modifications are intended to beincluded within the scope of the present invention. Any benefits,advantages, or solutions to problems that are described herein withregard to specific embodiments are not intended to be construed as acritical, required, or essential feature or element of any or all theclaims.

Furthermore, the terms “a” or “an,” as used herein, are defined as oneor more than one. Also, the use of introductory phrases such as “atleast one” and “one or more” in the claims should not be construed toimply that the introduction of another claim element by the indefinitearticles “a” or “an” limits any particular claim containing suchintroduced claim element to inventions containing only one such element,even when the same claim includes the introductory phrases “one or more”or “at least one” and indefinite articles such as “a” or “an.” The sameholds true for the use of definite articles.

For purposes of this disclosure, an information handling system mayinclude any instrumentality or aggregate of instrumentalities operableto compute, classify, process, transmit, receive, retrieve, originate,switch, store, display, manifest, detect, record, reproduce, handle, orutilize any form of information, intelligence, or data for business,scientific, control, gaming, or other purposes. For example, aninformation handling system may be a personal computer, a networkstorage device, or any other suitable device and may vary in size,shape, performance, functionality, and price. The information handlingsystem may include random access memory (RAM), one or more processingresources such as a central processing unit (CPU) or hardware orsoftware control logic, ROM, and/or other types of nonvolatile memory.Additional components of the information handling system may include oneor more disk drives, one or more network ports for communicating withexternal devices as well as various input and output (I/O) devices, suchas a microphone, keyboard, a video display, a mouse, etc. Theinformation handling system may also include one or more buses operableto transmit communications between the various hardware components.

Unless stated otherwise, terms such as “first” and “second” are used toarbitrarily distinguish between the elements such terms describe. Thus,these terms are not necessarily intended to indicate temporal or otherprioritization of such elements.

As will be appreciated by one skilled in the art, the present inventionmay be embodied as a method, system, or computer program product.Accordingly, embodiments of the invention may be implemented entirely inhardware, entirely in software (including firmware, resident software,micro-code, etc.) or in an embodiment combining software and hardware.These various embodiments may all generally be referred to herein as a“circuit,” “module,” or “system.” Furthermore, the present invention maytake the form of a computer program product on a computer-usable storagemedium having computer-usable program code embodied in the medium.

Any suitable computer usable or computer readable medium may beutilized. The computer-usable or computer-readable medium may be, forexample, but not limited to, an electronic, magnetic, optical,electromagnetic, infrared, or semiconductor system, apparatus, ordevice. More specific examples (a non-exhaustive list) of thecomputer-readable medium would include the following: a portablecomputer diskette, a hard disk, a random-access memory (RAM), aread-only memory (ROM), an erasable programmable read-only memory (EPROMor Flash memory), a portable compact disc read-only memory (CD-ROM), anoptical storage device, or a magnetic storage device. In the context ofthis document, a computer-usable or computer-readable medium may be anymedium that can contain, store, communicate, or transport the programfor use by or in connection with the instruction execution system,apparatus, or device.

Computer program code for carrying out operations of the presentinvention may be written in an object-oriented programming language suchas Java, Smalltalk, C++ or the like. However, the computer program codefor carrying out operations of the present invention may also be writtenin conventional procedural programming languages, such as the “C”programming language or similar programming languages. The program codemay execute entirely on the user’s computer, partly on the user’scomputer, as a stand-alone software package, partly on the user’scomputer and partly on a remote computer or entirely on the remotecomputer or server. In the latter scenario, the remote computer may beconnected to the user’s computer through a local area network (LAN) or awide area network (WAN), or the connection may be made to an externalcomputer (for example, through the Internet using an Internet ServiceProvider).

Embodiments of the invention are described with reference to flowchartillustrations and/or step diagrams of methods, apparatus (systems) andcomputer program products according to embodiments of the invention. Itwill be understood that each step of the flowchart illustrations and/orstep diagrams, and combinations of steps in the flowchart illustrationsand/or block diagrams, can be implemented by computer programinstructions. These computer program instructions may be provided to aprocessor of a general-purpose computer, special purpose computer, orother programmable data processing apparatus to produce a machine, suchthat the instructions, which execute via the processor of the computeror other programmable data processing apparatus, create means forimplementing the functions/acts specified in the flowchart and/or blockdiagram step or steps.

These computer program instructions may also be stored in acomputer-readable memory that can direct a computer or otherprogrammable data processing apparatus to function in a particularmanner, such that the instructions stored in the computer-readablememory produce an article of manufacture including instruction meanswhich implement the function/act specified in the flowchart and/or blockdiagram block or blocks.

The computer program instructions may also be loaded onto a computer orother programmable data processing apparatus to cause a series ofoperational steps to be performed on the computer or other programmableapparatus to produce a computer implemented process such that theinstructions which execute on the computer or other programmableapparatus provide steps for implementing the functions/acts specified inthe flowchart and/or block diagram block or blocks.

The present invention is well adapted to attain the advantages mentionedas well as others inherent therein. While the present invention has beendepicted, described, and is defined by reference to particularembodiments of the invention, such references do not imply a limitationon the invention, and no such limitation is to be inferred. Theinvention is capable of considerable modification, alteration, andequivalents in form and function, as will occur to those ordinarilyskilled in the pertinent arts. The depicted and described embodimentsare examples only and are not exhaustive of the scope of the invention.

Consequently, the invention is intended to be limited only by the spiritand scope of the appended claims, giving full cognizance to equivalentsin all respects.

What is claimed is:
 1. A computer-implementable method for sharing aspeakerphone system (206) with multiple participant devices (202)comprising: identifying an active speaker of a group of participants(302) in a physical meeting setting (204) using a web conferencingservice (208); choosing a participant device (202) of the identifiedactive speaker; sending audio of the identified active speaker to theweb conferencing service (208), wherein the web conference service (208)provides audio to the speakerphone system (206); and enabling videocaptured by the participant device (202) to be shown by the webconferencing service (208) on all participant devices (202) of the groupof participants (302).
 2. The computer-implementable method of claim 1,wherein the identifying comprises enrolling voice signatures ofparticipants; and identifying the active speaker using an enrolled voicesignature.
 3. The computer-implementable method of claim 2, wherein theenrolled voice signatures are associated with participant devices. 4.The computer-implementable method of claim 2, wherein the enrolled voicesignatures are stored in common dictionary configured to be accessed byeach participant device.
 5. The computer-implementable method of claim1, wherein the identifying is performed capturing voice of an activespeaker, generating a voice signature based on the captured voice, andcomparing the generated signature to an enrolled voice signature.
 6. Thecomputer-implementable method of claim 2, wherein the enrolled voicesignatures are stored in common dictionary configured to be accessed byeach participant device.
 7. The computer-implementable method of claim1, wherein the identifying is performed by active speaker recognitionrecognizing movement of lips of the active speaker.
 8. Thecomputer-implementable method of claim 1, wherein the choosing isperformed by a participant device.
 9. The computer-implementable methodof claim 1, wherein the choosing is performed by the speakerphonesystem.
 10. A system (100, 202, 210) comprising: a processor; a data buscoupled to the processor; and a non-transitory, computer-readablestorage medium embodying computer program code, the non-transitory,computer-readable storage medium being coupled to the data bus, thecomputer program code interacting with a plurality of computeroperations for sharing a speakerphone system with multiple participantdevices and comprising instructions executable by the processor andconfigured for: identifying an active speaker of a group of participants(302) in a physical meeting setting (204) using a web conferencingservice (208); choosing a participant device (202) of the identifiedactive speaker; sending audio of the identified active speaker to theweb conferencing service (208), wherein the web conference service (208)provides audio to the speakerphone system (206); and enabling videocaptured by the participant device (202) to be shown by the webconferencing service (208) on all participant devices (202) of the groupof participants (302).
 11. The system of claim 10, wherein theidentifying comprises enrolling voice signatures of participants andidentifying the active speaker through an enrolled voice signature. 12.The system of claim 10, wherein the identifying is performed by activespeaker recognition recognizing movement of lips of the active speaker.13. The system of claim 10, wherein the choosing is performed by thespeakerphone system.
 14. A non-transitory, computer-readable storagemedium (para [0047]) embodying computer program code for sharing aspeakerphone system with multiple participant devices, the computerprogram code comprising computer executable instructions configured for:identifying an active speaker of a group of participants (302) in aphysical meeting setting (204) using a web conferencing service (208);choosing a participant device (202) of the identified active speaker;sending audio of the identified active speaker to the web conferencingservice (208), wherein the web conference service (208) provides audioto the speakerphone system (206); and enabling video captured by theparticipant device (202) to be shown by the web conferencing service(208) on all participant devices (202) of the group of participants(302).
 15. The non-transitory, computer-readable storage medium of claim14, wherein the identifying comprises enrolling voice signatures ofparticipants and identifying the active speaker through an enrolledvoice signature.
 16. The non-transitory, computer-readable storagemedium of claim 15, wherein the enrolled voice signatures are associatedwith participant devices.
 17. The non-transitory, computer-readablestorage medium of claim 15, wherein the enrolled voice signatures arestored in common dictionary.
 18. The non-transitory, computer-readablestorage medium of claim 14, wherein the identifying is performed byactive speaker recognition recognizing movement of lips of the activespeaker.
 19. The non-transitory, computer-readable storage medium ofclaim 14, wherein the wherein the switching is performed by aparticipant device.
 20. The computer-implementable method of claim 14,wherein the switching is performed by the speakerphone system.